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We show the capacity of a generalized pulse-position modulation (PPM) channel, 
where the input vectors may be any set that allows a transitive group of coordinate 
permutations, is achieved by a uniform input distribution. We derive a simple 
expression in terms of the Kullback-Leibler distance for the binary case, and find 
the asymptote in the PPM order. We prove a sub-additivity result for the PPM 
channel and use it to show PPM capacity is monotonic in the order. 


I. Introduction 

NASA is currently developing the first operational deep-space optical communications link for launch 
on the Mars Telesat Orbiter in 2009. 2 The deep-space optical channel is well modeled as memoryless 
and operates efficiently at large peak-to-average power ratios, which may be efficiently implemented with 
pulse-position modulation (PPM) [1,2], in which each channel symbol is a unit vector. PPM satisfies 
the property that each symbol may be obtained as a permutation of the coordinates of another. We 
consider a generalization of this, where the input vectors may be any set that allows a transitive group 
of coordinate permutations. We derive an expression for the capacity of this generalized PPM channel in 
the binary case, and examine the behavior of the capacity of the PPM channel as a function of the PPM 
order. 

In Section II we show that, for a memoryless generalized PPM channel, capacity is achieved with 
equiprobable inputs. We show that a simple expression for the capacity follows for the binary case, and 
illustrate the asymptotic behavior of the PPM channel as the PPM size tends to infinity. In Section III 
we prove a sub-additivity result for the PPM channel and show that certain monotonic behavior follows. 


II. Capacity of Generalized PPM 

We use X, Y to denote random variables and x, y their realizations. Similarly, we let X, Y and x, y 
denote n- vectors of random variables and their realizations. Let Py\x(v\ x ) be the conditional density (or 
probability mass) function of a memoryless channel, and PY|x(y| x ) its ?rth extension. When it’s clear 
from the context, we simply write p(y\x) or p(y|x) for Py\x(d\ x ) and PY|x(y| x )- 
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Let S = {xi, X 2 , • • • , x s } be a set of length n vectors, px(') a probability distribution on S, and /(X; Y) 
the mutual information between X and Y. The S'-capacity of the channel is defined as 


C s = max /(X: Y) 

Px 


i.e., the capacity with inputs restricted to S. Let G be a group of coordinate permutations that fix S, 
i.e. , such that for each g £ G, a g S = S, where a g is the mapping imposed by g. If, in addition, G acts 
transitively on S, then we call S a transitive set. ( G acts transitively on S if for each x,,Xj £ S there 
exists g £ G such that x.j = a g (xj).) 

The capacity of a channel whose input is a transitive set follows from the well-known fact that /(X; Y) 
is convex-fl in the input distribution p [3, Theorem 4.4.2]. 

Theorem 1. If S is a transitive set, then Cs is achieved by a uniform distribution on S. 

Proof. Let p be a distribution on S and for g £ G let p 9 be given by p 9 (x.i) = p(crg(x,;)). Clearly 
any p 9 produces the same mutual information as p. Thus by Jensen’s inequality, (1/|G|) 'f2 g€ QP 9 yields 
mutual information greater than or equal to that yielded by p. But this new distribution is simply the 
uniform distribution: for any Xj we have 


1 1 g&G 1 1 g£G 


and as g ranges over G, cr g (x i) ranges over each element of S the same number of times (by the Orbit 
Stabilizer Theorem, e.g., [4, Theorem 8.2]); thus the above quantity is equal to 1/s. □ 

A. Binary Inputs 

With binary inputs, Cs reduces to a simple expression. Let the input alphabet be {0,1}, and 2o(x) 
and Xi(x) the collection of indices of the 0’s and l’s in x, respectively. For example, Xo(101) = 2, 
Xi(101) = {1,3}. Let JVi = |Zi(x)|, a constant for each x £ S, and D(-||-) be the Kullback-Leibler 
distance. 

Theorem 2. On a binary input channel with p(y\l) / p(y\0) < oo, 


C s = N 1 D(p(y\l)\\p(y\0)) - £>(p(y)||p(y|0)) 


Proof. With equiprobable inputs from Theorem 1, we have 
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Cs = J p{, y|xi)log 2 d y 


= / p(y|xi)log 2 , y 


= J p(y|xi)log 2 


Uwiw,^ 

p( y) 


dy 


= / p(y|xi)log 2 ]1 


i ieii(xi 


P(Vi | 0 ) 


dy -I piy)lo& {^m) dy 


= N 1 D(p(y\l)\\p(y\0)) - D(p(y)\\p(y\0)) 


n 


B. Pulse-Position Modulation 

In the remainder, we investigate the behavior of Cs as a function of n for the PPM channel. To that 
end, let I(n) be the capacity of a memoryless channel with PPM inputs of length ?r. We first treat the 
case in Theorem 2 where p(y\\) / p(y\C) is unbounded. 

Let U and A be the collections of unambiguous and ambiguous outputs when x = 1 is transmitted: 


U = {y\p{y |!) yf 0,p(y|0) = 0} 


A = {y\p(y I 1 ) > 0,p(y|0) ^ 0} 


If any coordinate of y belongs to U, the input will be known with certainty. In order to treat the 
ambiguous and unambiguous outputs separately, define a reduced channel p*(y\x) with output y £ A as 
follows: 


p*(y |o) =p{y | 0 ) 


P*(y | 1 ) 


p(y I 1 ) 
p(A\l) 


where p{A\l) = J A p(y\l)dy. Let 7*(X; Y) and I*(n) be the mutual information and capacity of the 
reduced channel, in bits per PPM symbol. 

Lemma 1. 


I{n) = p(U\l) log 2 n + p{A\\)I*(n) 
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Proof. Let x y be the symbol with li(xj) = j (a 1 in position j ) , and U n (xj ) and A n ( Xj ) the collections 
of unambiguous and ambiguous outputs when Xj is transmitted: 


U n (xj) = {y\yj G U} 

A n{Xj) = {y| Vj i U} 

Let U = U n {x),A = lJ xeS H„(x). Introduce a binary random variable Z as follows: 

7 _ /O, Y G U 
Z “\l, Yci 

Since each x contains exactly one nonzero entry, P(Z = 0) = _p(C7| 1) and P(Z = 1) = p(A\\). Note that 
p(x|z) =p(x); hence, H(X.\Z) = H(X) and 


/(X;Y)=/(X;Y,Z) 

= 7(X;Z) + J(X;Y|Z) 


= P{Z = 0 )/(X; Y|Z = 0) + P{Z = 1) J(X; Y|Z = 1) 


= p(U\l) log 2 n + p(A\l)I* (X; Y) 

The lemma follows since the capacity achieving input distribution is uniform for both channels. □ 

Hence we can decompose the n-ary PPM channel into an unambiguous channel, which contributes 
p(U |1) log 2 n to the capacity, and a reduced channel, with transition probabilities p*(y\x). In the remain- 
der, we assume the channel is reduced, which allows a simple corollary of Theorem 2. 

Corollary 1. For the reduced binary PPM channel, I(n ) = D(p(y\l)\\p(y\0)) — -D(p(y)||p(y|0)). 

Corollary 1 allows a straightforward proof of the asymptotic behavior of the memoryless PPM channel. 

Theorem 3. lim ^^/(n) = D (p(y\l)\\p(y\0)) . 

Proof. Let 1 denote the unit vector with a 1 in the first position: 

o < £>(p(y)lb(yl°)) 


= E-y log 2 


' p( y) ' 


= e y log 2 


ELiP(yl x »M x 0 

p(y|o) 


= e y log 2 


ifpJl 

n “ p(Yi | 0 ) 
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where the last inequality follows since 


E 


•Yi |X 4 =0 


P(Yi |1) 


.p(Xi | 0 ) 


y:p(y\0)>0 


p(y |1) dy< 1 


for t yt 1 . For a reduced channel there exists a constant /v such that 


E- 


Yi\Xi=l 


>(yj|i) 

p(n|0)J 


< K 


hence, 


0 < lim £>(p(y)||p(y|0)) < lim log 2 ( — + - — - ) =0 


n—>oo \ n n 


III. Capacity Inequalities 

Theorem 4. If n < m, then I(kn ) — I(n) > I(km ) — /(to). 

This theorem says that if we multiply the number of slots by k, the increase in PPM capacity will 
be larger if the original number of slots was smaller. The conclusion can be equivalently stated as 
I(kn) + I(m) > I (km) + I (n). 

Proof. Let Z Z rn , and Z n be random variables uniformly distributed on {1, • • • , fc}, {1, • • • , to}, and 
{1, . . . ,n}, respectively. Let Y n be the (random) output vector when Z n drives an n-PPM channel (that 
is, the input to the channel is an n- vector with a 1 in position Z n and zeros elsewhere). Similarly, let 
Y kn and be the output vectors when the ordered pairs (Zk,Z n ) and (Zk,Z m ) drive fcn-PPM and 
fem-PPM channels, respectively. In these two cases, it is useful conceptually to regard the slots as being 
arranged in a rectangular grid, with, for example, ( Z Z n ) specifying the column and row of the 1. 

Observe that H(Z n \Zk, Yj, n ) = H(Z n |Y„), because in the left-hand side Z specifies the column of 
n slots in which the 1 is “hiding”; thus the other slots can be ignored. We therefore have 


H(Z k ,Z n \Y kn ) = H(Z n \Z k ,Y kn )+H(Z k \Y kn ) 


= H(Z n \Y n ) + H(Z k \Y kn ) 


We then have 


I(kn) - I(n ) = log 2 kn - H(Z k , Z n \Y kn ) - log 2 n + H(Z n \Y n ) 


= log 2 k - H(Z k \Y kn ) 


Similarly, I (km) - I(m ) = log 2 k - H(Z k \Y km ). 
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All that remains is to show that H(Zk\Y km) > H(Zk |Yfc n ). Intuitively, this is clear because there 
is more ambiguity about which column the 1 is in when there are more rows. A more formal line of 
reasoning would involve introducing side information W in the fcm-PPM case that specifies a random set 
of n rows, one of which contains the 1. Then H(Zk\Y k m ) > H(Zk\Y k m , W) = H(Zk\Y k n )- □ 

Theorem 4 has the immediate consequence that if ?7ii?7i2 = ni?i 2 and mi + m 2 < rq + 712 , then 
/(mi) + I (m 2 ) > /(rq) + /(712). A special case of the theorem is that I(mn ) < /(m) + I(n). 

We are also now able to say something interesting about 2 fe -PPM: 

Corollary 2. For k = 1,2, • • •, the quantity I(2 k )/k is decreasing in k. 

Proof. Theorem 4 implies that 1(2) — /(l),/(4) — 1(2), 1(8) — 1(A),- ■■ is a decreasing sequence. 
Therefore, the average of the first k terms of the sequence is decreasing in k. But since /(l) = 0, the 
average of the first k terms is simply I(2 k )/k. □ 

Corollary 3. Fork G M, the bits-per-slot capacity of 2 k -PPM on a discrete-time memoryless channel 
is monotonically decreasing in k. 

Proof. The capacity of 2 fc -PPM in bits per slot is I( 2 k )/2 k . Thus, this result follows from (and is 
much weaker than) Corollary 2. □ 

A close look at Theorem 4 suggests that the following is likely true: The quantity (l(k + 1) — 
I(k))/(l°g(k + 1) — log k) is decreasing in k. Equivalently, the function I(k) is convex-fl when plotted 
as a function of log k. As of this writing, we have not proven this, so it is still a conjecture. Theorem 4 
would essentially be a special case of this result. The result would also imply more general versions of 
Corollaries 2 and 3: the quantity I(k)/\ogk would be decreasing in k for k > 2, and I(k)/k would be 
decreasing in k for k > 3. 


IV. Conclusions 

We have derived the capacity of a generalized PPM channel. The formulation applies to conventional 
PPM and multipulse PPM, among others. We showed that the capacity in bits per slot of conventional 
PPM decreases as the modulation order increases, a conclusion consistent with the decreasing average 
power of this sequence of modulations. 
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